Published online 17 December 2011 



Nucleic Acids Research, 2012, Vol 40, No. 8 3511-3523 

doi:10.1093/nar/gkrl203 



E2F7 represses a network of oscillating cell cycle 
genes to control S-phase progression 

Bart Westendorp 1 , Michal Mokry 2 , Marian J.A. Groot Koerkamp 3 , Frank CP. Holstege 3 , 
Edwin Cuppen 2 and Alain de Bruin 1 '* 

department of Pathobiology, Faculty of Veterinary Medicine, Utrecht University, 2 Cancer Genomics Center, 
Hubrecht Institute and University Medical Center Utrecht and 3 Molecular Cancer Research, University Medical 
Center Utrecht, Utrecht, the Netherlands 

Received October 17, 2011; Revised and Accepted November 17, 2011 



ABSTRACT 

E2F transcription factors are known to be 
important for timely activation of G-|/S and G 2 /M 
genes required for cell cycle progression, but 
transcriptional mechanisms for deactivation of cell 
cycle-regulated genes are unknown. Here, we show 
that E2F7 is highly expressed during mid to late 
S-phase, occupies promoters of G^S-regulated 
genes and represses their transcription. ChlP-seq 
analysis revealed that E2F7 binds preferentially 
to genomic sites containing the TTCCCGCC motif, 
which closely resembles the E2F consensus site. 
We identified 89 target genes that carry E2F7 
binding sites close to the transcriptional start site 
and that are directly repressed by short-term 
induction of E2F7. Most of these target genes are 
known to be activated by E2Fs and are involved 
in DNA replication, metabolism and DNA repair. 
Importantly, induction of E2F7 during G 0 -G^/S 
resulted in S-phase arrest and DNA damage, 
whereas expression of E2F7 during G 2 /M failed to 
disturb cell cycle progression. These findings 
provide strong evidence that E2F7 directly controls 
the downswing of oscillating G-|/S genes during 
S-phase progression. 

INTRODUCTION 

Since the discovery of E2F nearly 25 years ago as a 
biochemical activity able to bind the Adenovirus E2 
promoter and control the expression of genes involved 
in S-phase, a considerable amount of information has 
accumulated in support of a pivotal role for this protein 
family in the temporal control of gene expression during 
the cell cycle (1). Since that time, eight E2F family 
members have been identified in mammals (2). The clas- 
sical E2Fs (E2F1-6) regulate transcription of their target 



genes when bound to their promoters as dimers with a 
DP protein, whereas atypical E2Fs (E2F7-8) bind to 
promoters as homodimers or heterodimers without DP 
(3-5). At the structural level, the similitude of the 
atypical E2Fs with the classical E2Fs is limited to its 
DNA-binding domains (DBD), and here the atypical 
E2Fs are further distinguished from its relatives by 
possessing two DBD rather than one (6-11). 

Many studies have detailed the role for E2F activities in 
controlling gene expression at Gi/S, involving the activa- 
tion of genes encoding DNA replication proteins, enzymes 
responsible for DNA biosynthesis, proteins that assemble 
to form functional origin complexes and kinases that are 
involved in activation of DNA replication. In addition to 
this role for E2F, a substantial number of E2F-induced 
genes are normally regulated at G 2 of the cell cycle, 
encoding proteins known to function in mitosis (12-15). 
Consistent with these observations, global gene expression 
profiling and genome-wide promoter occupancy studies 
[chromatin immunopreciptation (ChlP)-on ChIP and 
ChlP-sequencing] have confirmed that many genes that 
are crucial for proper cell cycle progression are bona fide 
targets of E2F1, E2F4 and E2F6 (13,16-19). Despite 
the considerable progress that has been made toward 
understanding how classical E2Fs regulate the cell cycle, 
the identity of genes regulated by atypical E2Fs is still 
unknown. 

To obtain a complete understanding of the role of 
the atypical E2Fs in cell cycle control, it will require the 
identification of the full range of E2F target genes. One 
issue that may complicate attempts to determine the role 
of the individual E2Fs is that loss of one family member 
may lead to compensation by another, either as a result of 
increased levels of one family member for the other or 
replacement of one family member for the other at 
particular promoters. In fact, previous studies show that 
long-term loss of E2F7 leads to compensatory function by 
E2F8 and vice versa to ensure cell viability and survival 
of the organism (20). Furthermore, we and others 
demonstrated the existence of a direct transcriptional 
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feedback loop between E2F7/8 and E2F1 (20,21). 
Therefore, we have taken an unbiased approach of ChIP 
in combination with sequencing (ChlP-seq) that allows 
the identification of in vivo binding sites for E2F7 
without altering the ratios of the E2Fs to each other. 
E2F8 target genes could not be determined, because all 
commercial and home-made antibodies against E2F8 
were not of sufficient quality to perform ChlP-seq 
assays. To validate the obtained E2F7 targets and to 
determine their functional significance, we generated 
inducible E2F7 cell lines and evaluated the direct effects 
of short-term induction of E2F7 on target gene expression 
and cell cycle progression. 

MATERIALS AND METHODS 

Generation of cell lines 

Mouse E2F7 cDNA (Reference sequence: NM_1 78609.4) 
was amplified with primers that introduced a Hindlll site 
at the 5' and a BamHI site at the 3'-end, using Pfu poly- 
merase (Fermentas). The cDNA was then cloned into the 
pEGFP-N3 plasmid (Invitrogen) using a double digestion 
with these two enzymes, followed by ligation with T4 
ligase (Fermentas), in such a way that a C- terminal 
E2F7-EGFP fusion protein would be transcribed. In 
addition to wild-type E2F7, we used a E2F7 cDNA with 
both DNA binding domains mutated as described 
previously (20), which served as a negative control. 
Subsequently, EGFP, E2F7-EGFP and E2F7-DBD 
mutant-EGFP were digested out from the pEGFP 
plasmid by a double enzymatic digestion with Hindlll 
and Xhol, and subsequently ligated into the pcDNA4/ 
TO plasmid (Invitrogen) using T4 ligase. Successful 
cloning was confirmed with sequencing (Baseclear, 
Leiden, The Netherlands). Tet repressor-expressing 
HeLa cells (T-REx HeLa, Invitrogen) were transfected 
with these constructs, and stable clones were established 
by Zeocin selection (Invitrogen, 300ugml _1 ). The cells 
were cultured in DMEM containing 10% Tet System 
Approved fetal bovine serum (Clontech). Overexpression 
was induced by adding 0.2 jigml -1 doxycycline (Sigma) to 
the cell culture medium. Cells were synchronized by serum 
starvation followed by hydroxyurea block as following: 
first near-confluent cells were cultured for 36 h in 
serum-deprived medium (0.5% FBS). Then, cells were 
subcultured and allowed to attach to the culture dishes 
in DMEM containing 10% FBS for 3h. Subsequently, 
2mM hydroxyurea (Sigma Aldrich) was added to the 
medium for 16 h to arrest cells at the onset of S phase. 
Cells were released from the HU block by washing three 
times with PBS and adding fresh medium containing 10% 
FBS. In an additional experiment, cells were synchronized 
in G 0 by reducing the concentration of FBS to 0.1% for 
72 h. Subsequently, the serum concentration was increased 
to 10% to allow cell cycle reentry. 

E2 f7 loxP/loxP E2f8 loxP/loxP mouse embryonic fibroblasts 
were isolated and maintained as described previously (20). 
Immortalization with retroviral constructs containing 
Myc and Ras 61L was performed as described previously 
(20,22). The immortalized lines were then treated with 



retrovirus containing the Cre recombinase to generate 
E2/7/8 double knockout cells, according to standard 
methods (23). 

Flow cytometry 

For measurement of DNA contents, cells were trypsinized, 
washed with PBS, fixed with 70% ethanol and stored at 
4°C up to 1 week. Cells were washed twice with PBS, 
and then reconstituted in PBS containing 20(igml _1 
propidium iodide, 250ugml _1 RNase A and 0.1% 
bovine serum albumin (BSA). Staining and quantification 
of y-H2AX was done according to the protocol described 
by MacPhail et al. (24). Briefly, cells were fixed with 
ethanol, permeabilized with 0.1% Triton X-100 and 
stained with a rabbit polyclonal antibody directed 
against y-H2AX (NB 100-2280, Novus Biologicals). 
After washing, cells were incubated for 1 h with anti- 
rabbit conjugated to Dylight 649 (711-495-152, Jackson 
Laboratories). After re washing, cells were stained with 
ljigml -1 4 / ,6-diamidino-2-phenylindole (DAPI), and 
analyzed immediately on a flow cytometer. All samples 
were analyzed on a BD FACS Canto II or a FACScan 
flow cytometer. Cell cycle analysis was done using 
FlowJo 7.6. 

Quantitative PCR and microarray 

Isolation of RNA, cDNA synthesis and qPCR were 
performed as previously described (25). Gene expression 
was calculated using a AAC t method adapted for 
multiple-reference gene correction (26). All samples were 
corrected for two reference genes: [3-Actin, and 18S ribo- 
somal RNA. Primer sequences are provided in 
Supplementary Table S7. 

Gene expression microarrays were performed using 
human 70-mer oligos (Operon, Human V2 AROS) 
spotted onto Codelink activated slides (Surmodics 
USA). RNA amplifications, labeling and hybridizations 
were performed on an automated system (Caliper Life 
Sciences NV/SA, Belgium) with 3 ug total RNA from 
each sample, as previously described (27). Hybridizations 
were done on a HS4800PRO system (Tecan Benelux 
B.V.B.A.) using 1000 ng labeled cRNA per channel. 
Hybridized slides were scanned on an Agilent scanner 
(G2565BA) at 100% laser power, 30% PMT. After auto- 
mated data extraction using Imagene 8.0 (BioDiscovery), 
printtip Loess normalization was performed on mean 
spot-intensities (28). Four biological replicates of vehicle- 
and doxycycline-treated samples were compared. Half of 
the replicates were labeled with Cy5 against control 
RNA (Cy3) on dual channel arrays; the second half was 
analyzed with opposite dye labeling. 

Data were analyzed using ANOVA (MAANOVA, 
version 1.41). In a fixed-effect analysis, sample, array 
and dye effects were modeled. P- values were determined 
by a permutation F2-test, in which residuals were shuffled 
5000 times globally. Genes with P<0.05 after FDR 
correction and cut-off m value ( 2 log fold change) of 
±0.25 were considered significantly changed. The up- or 
down-regulated transcripts in doxycycline-treated samples 
were analyzed using the DAVID and PANTHER gene 
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ontology tools (29-31). Data from the microarray study 
are available at the Gene Expression Omnibus, accession 
number GSE33448. 

Chromatin immunoprecipitation and ChlP-sequencing 

ChIP was performed according to Millipore's EZ ChIP 
protocol, with the following specifications: cells were 
crosslinked for lOmin by 1% freshly made formaldehyde. 
Crosslinking was stopped by 5min of lOOmM glycine. 
Cells were then washed twice with PBS, and scraped in 
1 ml of ChIP lysis buffer. Lysates were sonicated for 8 min 
at maximum power on a Covaris S2 (Covaris). For 
ChlP-qPCR 1 million cells were used for each reaction; 
for ChlP-seq, 25 million cells were used for each reaction. 
IPs were performed using BSA-blocked Fast Flow 
Agarose G beads (Millipore, 16-266). A rabbit polyclonal 
E2F7 antibody (sc-66870, Santa Cruz) was used for 
immunoblotting (1:1000), conventional ChIP (1 jig per 
reaction) and ChlP-seq (10 ug per reaction). 

To improve signal-to-noise ratios in the ChlP-seq 
experiments, a double-IP protocol was used. Antibody- 
chromatin complexes were eluted from the beads using 
200 (il of buffer containing 1% SDS and lOOmM 
NaHC0 3 , and dissolved in 10 ml of ChIP dilution 
buffer. The IP was then repeated with 5 ug antibody, 
and processed according to the EZ ChIP protocol. 
Library construction and sequencing were performed as 
described previously (32). Briefly, immunoprecipitated 
chromatin was sheared, end-repaired, sequencing 
adaptors were ligated and the library was amplified by 
ligation mediated PCR (LMPCR). After LMPCR, the 
library was purified and checked for the proper size 
range and for the absence of adaptor dimers on a 2% 
agarose gel, barcoded and sequenced on SOLiD/AB 
sequencer in multiplexed way to produce 50-bp long 
reads. Sequencing reads were mapped against the 
reference genome (hgl9 assembly, NCBI build 37) using 
BWA package (33). Multiple reads mapping to same 
location and strand have been collapsed to single read 
and only uniquely placed reads were used for peak-calling. 
Cisgenome (34) was used for peak-calling from the 
ChlP-seq data. Motif analysis was performed using 
CisModule function (35), using the following parameters: 
motif number K= 15, mean motif length Lambda = 10, 
maximal motif length allowed =18, Initial motif 
length =10, initial module size D = 3.0, module 
length = 100, order of background Markov chain = 3, 
MCM iteration = 500. A combination of custom PERL 
and R scripts and Cisgenome functions were used for 
additional data analysis. Data from the ChlP-seq study 
are available at the Gene Expression Omnibus, accession 
number GSE32673. 

Immunoblotting 

Cells were harvested by washing twice with PBS, and 
scraped in a lysis buffer containing 50 mM Tris-HCl, 
ImM EDTA, 150mM NaCl, 0.25% deoxycholic acid, 
1% Nonidet-P40, and protease inhibitors (Roche). Cells 
were lysed on ice for 20 min, and centrifuged for 10 min at 
12000g. The supernatants were then immunoblotted with 



standard SDS-PAGE techniques. The following anti- 
bodies were used: E2F7 (Santa Cruz sc-66870), E2F1 
(sc-193), GFP (sc-9996), RAD 51 (sc-8349), oc-Tubulin 
(Sigma- Aldrich T5168), y-Tubulin (T6557), mouse IgG 
HRP-linked whole Ab (GE Healthcare, NA931), rabbit 
IgG HRP-linked whole Ab (NA934). Visualization was 
done by ECL (GE Healthcare RPN2106) and exposure 
to a film (GE Healthcare). 

RESULTS 

Genome-wide mapping of E2F7 binding sites 

To identify novel E2F7 binding sites in promoters of 
target genes, we performed ChlP-seq experiments. Given 
that E2F7 expression is regulated in a cell cycle-dependent 
manner (6,7), we first determined when E2F7 protein 
levels peak during the cell cycle to provide sufficient 
enrichment required for ChlP-seq experiments. To this 
end, HeLa cells were synchronized at the beginning of 
S-phase by arresting cells in the presence of hydroxyurea 
(HU). Upon removal of the drug, these cells then progress 
through S-phase, G 2 , and mitosis, as evaluated by 
propidium iodide staining and flow cytometry (Supple- 
mentary Figure SI). Immunoblotting showed that E2F7 
protein levels peaked at 4-6 h after HU removal, when 
cells are in mid to late S-phase (Figure 1A). For compari- 
son, E2F1 protein levels peaked at 0-4 h during early 
S-phase and decreased afterward, when E2F7 accumulates 
maximally. These findings are in line with a feedback 
mechanism between these two transcription factors, 
where E2F1 activates E2F7 expression and E2F7 represses 
E2F1 expression (20). Utilizing siRNAs targeting against 
either E2F7 or E2F1, we confirmed antibody specificity 
(data not shown). Based on these expression data, cell 
lysates were harvested 6h after HU release to perform 
ChlP-seq experiments with an antibody specific for 
endogenous human E2F7. A sequencing library was 
created, and after mapping and filtering, 5 647 197 
unique reads were used for peak calling. In addition, we 
sequenced a parallel sample of input DNA from the same 
cells as a control to ensure that enriched sites are not an 
artifact of the processing and sequencing. Thus, ChIP 
peak scores were normalized corresponding input peak 
scores during the peak calling algorithm, resulting in 
1107 significantly enriched E2F7 peaks. Almost 50% of 
the filtered peaks were located within 1 kb from annotated 
transcription start sites (Supplementary Figure S2). 
A distance of maximum 5kb up- or downstream from a 
transcription start site was taken as a cut-off for identifi- 
cation of putative E2F7 target genes. Gene ontology (GO) 
analysis on the 737 genes identified this way 
(Supplementary Table SI) showed highest fold enrich- 
ments for pathways related to DNA metabolism, repair 
and replication (Supplementary Table S2). In fact, every 
enriched category was directly related to the cell cycle, 
with the exception of the GO clusters 'mRNA splicing' 
and 'Pre-mRNA processing'. Among the 737 genes 
identified, many known E2F target genes were present, 
such as CDC6, MCM2 and RAD 51 (Figure IB and 
Supplementary Table SI). 
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Figure 1. Identification of E2F7 target genes by ChlP-seq analysis. (A) E2F7 and E2F1 immunoblots on HU-synchronized HeLa cells. 
Alpha-Tubulin immunoblots served as loading control. (B) Examples of E2F7 ChIP peaks near transcription start sites of three different genes. 
Chromosome coordinates are indicated above the peaks. The bottom tracks show UTRs, coding regions, and introns indicated by thin or thick 
boxes, and lines respectively. The direction of transcription is from left to right in all shown genes. (C) ChlP-qPCR confirmation of a panel of target 
genes in HU-synchronized HeLa cells. ChlPs were performed using an E2F7 antibody or non-immune IgGs as controls. ChIP qPCR on a 
non-specific region 700 bp upstream of E2F binding sites of the E2F1 promoter (E2F1 exon 1) served as negative control for unspecific binding. 
Graphs represent average ± SD (n = 2) (D) Position weight matrix of highest overrepresented motif within E2F7 peaks compared with random DNA 
regions. (E) Distribution of the E2F7 binding motif among ChlP-seq peaks, based on proximity to nearest transcription start site (TSS). 



Nucleic Acids Research, 2012, Vol 40, No. 8 3515 



E2F7 mRNA 



B 




0 4 8 12 24 hours DOXY 
E2F7-EGFP 



E2F1 
y Tubulin 




9 1 5 10 
Fold enrichment 



Ontology term: 
Cell cycle 
Cell cycle control 
DNA replication 
DNA repair 
DNA metabolism 

Nucleoside, nucleotide and nucleic acid metabolism 

DNA recombination 

Meiosis 

Mitosis 

Chromosome segregation 
Chromatin packaging and remodeling 
Pre-mRNA processing 
mRNA splicing 



Figure 2. Validation of E2F7 target genes by microarray analysis 
after doxycycline-induced E2F7 overexpression. (A) qPCR analysis 
for E2F7 gene expression in cell lines stably expressing doxycycline 
inducible E2F7-EGFP (E2F7), E2F7-DBDmut-EGFP (DBDmut), or 
EGFP alone, cultured either with doxycycline (DOXY) or vehicle for 
24 h. Data represent average ± SEM in = 3), asterisks indicate P < 0.05 
versus vehicle (B) GFP and E2F1 immunoblots on HeLa cell lysates 
harvested at different time points after doxycycline-induced E2F7- 
EGFP expression. Immunoblotting for y-tubulin served as loading 
control. (C) Venn diagram to show overlap between down- (red) and 
upregulated (green) transcripts identified by gene expression microarray 
after 8h E2F7 overexpression and transcripts related to genes with 
a E2F7 ChIP peak (blue) within 5kb from their transcription start 
site (TSS). (D) Percentages of genes containing a ChIP peak within 
5kb from the TSS, categorized according to regulation in gene expres- 
sion microarray after E2F7 overexpression, i.e., downregulated 
(DOWN), no change (NC), or upregulated (UP). Error bars represent 
standard deviation of 1000 randomized datasets with resampled gene 
names. Triple asterisk indicates P< 0.001 calculated from permutation 
test. (E) Heat map of PANTHER gene ontology enrichments in micro- 
array and ChlP-seq showing similar enrichments of gene ontology 
clusters. Only clusters with Benjamini scores <0.05 were considered 
significantly enriched. 



We confirmed binding of E2F7 to promoters of these 
genes specifically during S-phase using ChIP followed by 
quantitative PCR (Figure 1C). Interestingly, we found 
that E2F7 occupied the promoters of E2F1, E2F2, E2F3 
and E2F8 as well as its own promoter, supporting our 
previous studies showing that atypical E2Fs can regulate 
the transcription of the classical activator E2Fs 
(Supplementary Table SI) (6,20). Recent ChlP-seq 
studies showed that E2F4 binds to promoters of all 
E2Fs and all pocket proteins (18). Here, we found that 
E2F7 also occupied the promoter of pi 07 (RBL1), but not 
the promoters of other pocket proteins or other classical 
E2F repressors (E2F4-6). 

E2F7 binds to the consensus E2F site 

A de novo motif search of E2F7-enriched DNA regions 
yielded a motif closely resembling the classic E2F- 
binding consensus sequence TTTSSCGC (Figure ID). In 
total, 15 motifs were found, of which 7 were enriched 
> 2-fold over random DNA regions (Supplementary 
Table S3). Without exception, these seven motifs are 
known binding sites for other transcription factors, such 
as NF-Y and SP1. A motif resembling the classic consen- 
sus E2F binding site showed by far the highest enrichment 
(6-fold; Figure ID and Supplementary Table S3). 
Single or multiple copies of this E2F7 binding site motif 
were present in 70% of the peaks located within 5kb 
from a transcriptional start site, but only in 20% of the 
more distant peaks (Figure IE). 

E2F7 represses transcription of Gl/S genes 

To complement the ChlP-seq studies described above, 
we overexpressed E2F7 and analyzed the global gene 
expression by microarray s. We generated HeLa cell lines 
stably expressing doxycycline-inducible E2F7 constructs 
tagged with enhanced green fluorescent protein (EGFP). 
Cell lines containing only EGFP constructs, or E2F7 with 
mutated DNA binding domains (DBDmutant) construct 
served as negative controls. In absence of doxycycline, 
stably transfected E2F7 cell lines did not show increased 
E2F7 mRNA levels compared to EGFP only cell lines, 
confirming that the inducible expression system was not 
leaky (Figure 2A). A robust increase in E2F7 protein was 
seen as soon as 4h after initiation of doxycycline treat- 
ment (Figure 2B). To identify direct targets of E2F7, 
we induced E2F7 expression for 8h, when E2F7 protein 
just reached maximum levels without yet reducing protein 
levels of E2F1, a known direct target of E2F7 (Figure 2B) 
(20). Thus indirect transcriptional effects through an 
E2F1 feedback loop could be ruled out. Given the 
strong enrichment of E2F7 on promoters of genes 
involved in DNA replication, we performed expression 
studies in an HU-synchronized cell population. HeLa 
cells were treated for 16 h with HU and during the last 
8h of the HU-synchronization E2F7 expression was 
induced by adding doxycycline. We then performed gene 
expression analysis on four biological replicates of doxy- 
cycline- versus vehicle- treated cells by microarrays. 
Three hundred and thirty-four transcripts were signifi- 
cantly decreased, whereas only 80 were increased 
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(Supplementary Table S4). GO analysis of the down- 
regulated transcripts showed that the most strongly 
enriched pathways were related to the cell cycle, especially 
DNA replication, repair, and metabolism (Supplementary 
Table S5). Analysis of the upregulated transcripts yielded 
no significantly enriched pathways. 

To determine whether there is an overlap between 
E2F7 targets identified in our ChlP-seq and microarray 
experiments, we performed comparative analysis on both 
data sets. Remarkably, 89 transcripts (27%) that were 
downregulated after E2F7 overexpression also had an 
E2F7 peak within 5kb from its transcription start 
site, whereas upregulated genes showed no overlap 
(Figure 2C and D). These genome- wide studies provide 
strong evidence that E2F7 acts strictly as a transcriptional 
repressor. The widespread overlap in targets points to the 
high degree of specificity achieved in our ChlP-sequencing 
and microarray experiments. On the basis of these 
findings, we propose that these 89 transcripts are the top 
E2F7 gene targets (Table 1). Importantly, the expression 



of the majority of these E2F7 target genes are known to 
be regulated at Gi/S phase of the cell cycle (Table 1), 
suggesting that E2F7 is critical to down-regulate these 
genes during late S-phase, at the time of the cell cycle 
when E2F7 accumulates maximally (Figure 1A). 
Functional annotation on the overlapping target genes 
derived from both genome-wide studies confirmed again 
a strong enrichment for gene products known to be 
involved in DNA replication, metabolism and repair 
(Figure 2E and Supplementary Table S6). Interestingly, 
both microarray and ChlP-seq experiments showed 
enrichment of target genes involved in mRNA processing 
and regulation of splicing (e.g. DHX15, HNRNPD and 
several SFRS isoforms), suggesting a regulatory role for 
E2F7 in controlling mRNA splicing, as described 
previously for E2F1 (36). 

We then used quantitative PCR analysis to assess 
the effect of acute induction of E2F7 on the transcription 
of target genes in a time-dependent manner. We took 
advantage of our inducible cell lines containing either 



Table 1. Identification of top E2F7 target genes by combination of ChlP-seq and gene expression microarray 
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wild-type E2F7 or E2F7-DBD -mutant. The cell lines were 
synchronized by HU block and during the last hours of 
HU synchronization E2F7 was induced by adding doxy- 
cycline for 4, 8 or 12 h. As seen in Figure 3 A, the expres- 
sion of transcripts involved in DNA replication or DNA 
repair started already to decrease after 4 h of doxycycline 
treatment. Previous studies demonstrated that E2F1, 
E2F2 and E2F3 control expression of genes involved 
in DNA replication as well as mitosis and cytokinesis 
(12-14). We therefore examined the expression of a 
panel of genes implicated in mitosis/cytokinesis, but 
failed to detect a significant decrease in gene expression 
after E2F7 induction (Figure 3A). Induction of E2F7 
DBD mutant in HeLa cells had no effect on E2F target 
gene expression (Figure 3A and Supplementary Figure 
S3). These results confirmed our genome-wide studies 
showing that transcripts involved in DNA replication, 
metabolism and repair are direct targets of E2F7, 
whereas mitotic or cytokinetic genes are not regulated 
by E2F7. 

One possibility for the failure of E2F7 to repress G 2 /M 
genes might be that we performed our studies in 
HU-arrested cells. We, therefore, induced expression of 
E2F7 during G 2 /M phase at the time when mitotic/ 
cytokinetic gene products such as CCNB1, PLK1, and 
ECT2 accumulate maximally (37-39). To this end, cells 
were first synchronized by HU, and then released from 
the HU-block for 9 h, and E2F7 expression was induced 
during the last 9h. About 70% of the cells treated with 
doxycycline were in G 2 /M phase of the cell cycle and 
expressed E2F7-EGFP (Figure 3B). Quantitative PCR 
analysis revealed that mRNA levels of CCNB1, 
PLK1 and ECT2 (Figure 3B), as well as Aurora kinase A 
and RACGAP1 (data not shown) were again not 
downregulated by E2F7 overexpression. In contrast, 
RAD51 and E2F1 mRNA were significantly decreased 
9h after hydroxyurea release. Consistent with these 
findings, we found that E2F7 overexpression in asyn- 
chronously growing cells resulted in downregulation 
of Gi/S genes but not G 2 /M genes (Supplementary 
Figure S3). 

Given the above observations, we examined whether 
genetic disruption of E2f7 increases expression of Gi/S 
genes but not G 2 /M genes. Previous studies showed 
that E2F8 can compensate for the loss of E2f7 (20). We, 
therefore, generated E2f7~^~ E2f8~ l ~ mouse embryonic 
fibroblasts and measured gene expression on a panel of 
Gi/S and G 2 /M genes by quantitative PCR. Messenger 
RNA levels of Gi/S genes were ~3-fold increased 
compared to wild-type cells, whereas mRNA levels of 
G 2 /M genes were not affected or only modestly increased 
(Figure 3C). 

Subsequently, we studied whether the increased expres- 
sion of the Gi/S genes was maintained during the later 
phases of the cell cycle. To this end, we arrested 
wild-type and E2f7^~ E2f8~ l ~ mouse embryonic fibro- 
blasts at the onset of M-phase with 16 h of nocodazole 
treatment. Again, expression of Cdc6, Cdtl and Mcm2 
was significantly increased in the E2f7 _l ~ E2f8 _l ~ cells 
(Figure 3D). 



From these results, we conclude that E2F7 directly 
represses the expression of specific Gi/S genes involved 
in DNA replication, metabolism and repair. 

E2F7 overexpression leads to S-phase arrest 

Given the above observation, we hypothesize that E2F7 
overexpression might impair DNA replication. To test this 
possibility, we induced E2F7 expression in a synchronized 
population of cells stimulated to progress through the cell 
cycle. We synchronized cells in G^S by serum deprivation 
followed by restimulation with medium containing 10% 
serum and HU. Cells were then washed and incubated 
with medium lacking HU and harvested at various times 
following HU release. E2F7 expression was induced by 
adding doxycycline to the medium 12 h before the HU 
release. Cell cycle progression was monitored by flow 
cytometry (Figure 4A). Strikingly, E2F7 overexpression 
arrested cells in early S-phase, indicating that E2F7 has 
an important role in regulating DNA replication. As 
expected, cells treated with vehicle and cells expressing 
the doxycycline inducible E2F7-DBD mutant progressed 
normally through the cell cycle and passed through 
a complete round of DNA replication within 9h after 
HU release. 

To investigate whether E2F7 overexpression also affects 
G 2 /M progression, we induced E2F7 expression at a later 
time point by adding doxycycline at the onset of 
hydroxyurea release (Figure 4B). Despite robust induc- 
tion of E2F7 protein during G 2 /M (Supplementary 
Figure S4A), cell cycle progression was not disturbed, 
supporting our previous observation that E2F7 function 
is not critical for regulating transcription of G 2 /M genes. 

To evaluate whether E2F7 overexpression also affects 
entry of quiescent cells into the cell cycle, we induced 
E2F7 expression in serum starved cells and found that 
cells enter the cell cycle but arrested in S-phase when 
cells were stimulated by serum addition (Figure 4C). 
Consistent with these observations, E2F7 overexpression 
in asynchronized cells dramatically reduced the number of 
phospho-histone-3 (PH3) positive cells, underscoring 
that E2F7 overexpression induces an S-phase arrest 
and thereby prevents entry into G 2 /M (Supplementary 
Figure S4B and C). To ensure that the observed S-phase 
accumulation was not a cell type-specific event of our 
inducible HeLa cell lines, we transfected asynchronously 
growing 293 T kidney epithelial cells with E2F7 
(Supplementary Figure S5). In line with our previous 
results, we observed a clear S-phase accumulation. 

We observed that a small population of E2F7 
overexpressing cells continued to pass through the cell 
cycle, but E2F7-EGFP flow cytometry analysis revealed 
that this population was composed of low expressing 
E2F7 cells (Supplementary Figure S6). Together these 
studies suggest that E2F7 controls S-phase progression 
through repressing E2F target genes involved in DNA 
replication and metabolism. 

E2F7 overexpression results in DNA damage 

Given the overrepresentation of DNA repair genes in the 
list of E2F7 target genes generated from our genome-wide 
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Figure 3. E2F7 represses transcription of Gi/S but not G 2 /M genes. (A) Gene expression of Gl/S- and G2/M-regulated genes involved in DNA 
replication, DNA repair and mitosis at different time point after doxycycline (DOXY) dependent induction E2F7-EGFP (E2F7) or 
E2F7-DBDmut-EGFP in HU arrested cells, analyzed by qPCR. Graphs represent average ± SEM (n = 3). Asterisks indicate P < 0.05 versus its 
corresponding Oh time point. (B) FACS plots show that cells treated with doxycycline or vehicle were in G 2 9h after HU release; green overlay 
indicates percentage of E2F7-GFP positive cells versus all propidium iodide stained cells. Gene expression analysis similar as in A was performed on 
Gi/S- and G 2 /M-regulated genes in cells harvested 9 h after HU release and doxycycline treatment. Data represent average ± SEM (n = 3); asterisks 
indicate P<0.05 versus vehicle. (C) Gene expression analysis of Gi/S- and G 2 /M-regulated genes in Myc/Ras transformed wild-type and 
E2f7- / -E2f8~ / - mouse embryonic fibroblasts (MEFs) by qPCR. Data represent average ± SEM (n = 5); Asterisks indicate P < 0.05 versus 
wild-type. (D) FACS plots showing a dramatic increase in 4n Myc/Ras transformed wild-type and E2fl~ l ~E2f8~ l ~ MEFs after 16 h of nocodazole 
(250ngml _1 ) Gene expression analysis of Gi/S-regulated genes in the MEFs of indicated genotypes by qPCR. Data represent average ± SEM (n = 3); 
Asterisks indicate P < 0.05 versus wild-type. 
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Figure 5. E2F7 overexpression causes DNA damage. (A) Flow 
cytometric analysis of asynchronously growing cells, where E2F7 or 
E2F7 DBD-mutant expression was induced for 24 h by adding doxy- 
cycline. Vehicle treated cells were used as controls. Cells were labeled 
with DAPI and antibodies against y-H2AX a marker for DNA 
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studies, we determined whether E2F7 overexpression 
would cause DNA damage. We show that induction 
E2F7 for 24 h resulted in increased DNA double-strand 
breaks, analyzed by flow cytometric quantification 
of fluorescent y-H2AX staining (Figure 5 A and B). 
Consistent with this observation, immunoblotting 
revealed that protein levels of Rad51 — a pivotal factor 
to repair double strand breaks by homologous 
recombination — were markedly decreased upon E2F7 
overexpression (Figure 5C). Previous work indicated 
that y-H2AX is also activated in apoptotic cells (40). 
To examine whether E2F7 overexpression causes cells to 
undergo apoptosis, we performed Annexin V staining and 
calculated the rate of apoptotic cells by flow cytometry. 
E2F7 induction for 24 h did not increase the number of 
apoptotic cells at the time when increased DNA damage 
was detected by y-H2AX staining (Figure 5D). However, 
at 48 h after E2F7 induction, we observed increased 
numbers of apoptotic cells. In line with these findings, in 
cells synchronized by serum starvation and stimulated to 
enter the cell cycle by serum addition, we detected a rising 
sub-Gi peak between 36 and 48 h after E2F7 induction by 
PI flow cytometry analysis (Figure 4D). These results 
indicate that long term E2F7 overexpression can cause 
apoptosis, possibly due to the accumulation of DNA 
damage. 



DISCUSSION 

Here, we have identified novel genomic binding sites for 
the E2F7 transcription factor in promoters of mammalian 
cells using ChlP-seq technology. However, the binding of 
a transcriptional regulator to the promoter region of 
a gene suggests that the factor has regulatory effects on 
the gene, but it is also possible that the factor does not 
fully or even partially control the gene. For this reason, 
we have identified a set of genes where E2F7 binding 
correlates with reduced gene expression after short term 
induction of E2F7. The data derived from these two 
independent genome-wide approaches were combined 
and resulted in a list of 89 genes, which we consider as 
the top E2F7 target genes and that encode gene products 
involved predominantly in DNA replication, metabolism 
and DNA repair. Inducible E2F7 overexpression arrested 
cells in S-phase, providing strong evidence for an import- 
ant role for E2F7 in controlling S-phase progression 
through transcriptional repression of its target genes. 
These findings are supported by our previous studies 
showing that ablation of atypical E2Fs in MEFs acceler- 
ates S-phase progression (20). Because, E2F7 expression 
is regulated in a cell cycle-dependent manner, the 
genome- wide studies were performed in synchronized 
cell populations, to achieve maximal expression of E2F7 
and sufficient enrichment on its targets. A possible 
disadvantage of this approach is that we select for E2F7 
target genes regulated during S-phase and might miss 
E2F7 target genes that are not expressed during this cell 
cycle phase. 

We have identified TTCCCGCC as the predominant 
E2F7 binding motif. This motif closely resembles the 
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classic E2F binding motif TTTSSCGC, where S is either a 
C or a G. Over 70% of all ChIP peaks mapping within 
5kb from a transcription start site contained at least 
one repeat of this motif. This percentage was even 
higher when considering only the 89 genes that were also 
down-regulated after E2F7 overexpression (90%). Our 
data are in contrast with recent studies utilizing 
promoter occupancy assays, which revealed that only a 
very small percentage of the regions bound in vivo by 
E2F1, E2F4 or E2F6 contained consensus E2F motifs, 
suggesting that classical E2Fs are recruited in vivo 
predominantly to non-consensus binding sites (16-19). 
We retrieved a relatively low number of E2F7-enriched 
regions in the ChlP-seq analysis, about 10-fold lower 
than reported numbers of peaks in ChlP-seq studies on 
E2F1 and E2F4 binding (17,18). Apart from differences in 
methodology or antibody characteristics, this could be 
explained by different binding properties of atypical 
E2Fs compared to E2F-DP heterodimers. In vitro 
protein-DNA binding studies suggested different 
binding specificity for combinations of E2F-DP (41). It 
is conceivable that E2F7/8 homo- and heterodimers 
also have unique DNA binding characteristics. 

A substantial number of the E2F7 target genes have 
been previously linked to E2F function. A role for 
E2F1, E2F2 and E2F3a in activation several of these 
DNA replication, metabolism and repair genes has been 
well established (23). Strikingly, the transcription of the 
majority of these E2F target genes is repressed directly by 
E2F7, suggesting that E2F7 counterbalances the tran- 
scription of E2F target genes activated by E2Fl-3a. 
Previous studies have shown that the transcription of 
these E2F target genes oscillates during the cell cycle 
with peak levels during the G\/S phase transition 
(12,38,42). Our studies show that protein levels of E2F7 
accumulate maximally at mid to late S-phase coinciding 
with the time when the expression of E2F target genes 
declines. In line with this, our previous studies 
demonstrated that transcription of E2F target genes 
such as Cdc6 increased acutely during S and G 2 when 
atypical E2Fs were deleted in synchronized MEFs (20). 
Together, these findings suggest that classical E2F activa- 
tors are essential for the upswing in the oscillating pattern 
of the cell cycle-specific expression of Gi/S genes, whereas 
the atypical E2Fs contribute directly to the downswing. 

Previous work has demonstrated that E2F activators 
also control transcription of G 2 /M-regulated genes 
encoding proteins known to function in mitosis (12-15). 
We provide substantial evidence that the expression of 
G 2 /M-regulated genes is not controlled directly by E2F7. 
First, we show that induction of E2F7 during S-G 2 /M 
does not disturb cell cycle progression, whereas induction 
of E2F7 expression during G 0 or Gi/S arrested cells in 
S-phase. Second, E2F7 protein levels dramatically 
decrease during G 2 /M indicating that E2F7 is not 
relevant during this phase of the cell cycle. Third, 
overexpression of an E2F7 degradation-resistant mutant 
that is highly expressed during G 2 /M failed to block cell 
cycle progression during mitosis (B. Westendorp and A. 
de Bruin, unpublished data). Finally, our genome- wide 
studies revealed that the majority of the G 2 /M genes are 



not repressed by E2F7 and do not contain E2F7 binding 
sites in their promoters. These observations are supported 
by a previous study showing that E2F1 but not 
E2F7 bound to promoters of CDC2 and CCNA2, two 
G 2 /M-regulated genes (7). However, it is possible that 
manipulation of E2F7 expression levels has indirect 
effects on the regulation of G 2 /M through altering expres- 
sion of Gi/S genes. In fact, we observed that long 
term overexpression of E2F7 down-regulated expression 
of both Gi/S and G 2 /M genes, suggesting that 
E2F7-mediated repression of E2F activators leads subse- 
quently to decreased expression of G 2 /M genes. This is in 
line with work from Zhu et al. (14) demonstrating that 
E2F activators link the control of Gi/S and G 2 /M 
transcription. 

Our experiments also demonstrate that E2F7 represses 
directly important players in the DNA damage response — 
most notably RAD51, CHEK1 and the BRCA1 and 
-2 genes. In line with this, E2F7 overexpression caused a 
marked increase in DNA damage. Consistent with these 
observations, Zalmas et al. (21) showed that inactivation 
of atypical E2Fs prevents cell cycle effects that occur in 
response to DNA damage. Negative feedback regulation 
of DNA repair is essential, which is illustrated by studies 
that demonstrate that inappropriate expression of the 
E2F7 targets such as RAD51 and BLM contribute to 
genomic instability (43,44). Furthermore, increased levels 
of many DNA repair genes in tumors are often associated 
with poor prognosis (45,46). 

Our previous microarray analysis on E2f7~ / ~E2f8~ / ~ 
mouse embryos demonstrated deregulation of predomin- 
ately gene products known to be activated in response to 
stresses, including hypoxia, nutrient deprivation and 
apoptosis (20). In contrast, our current genome-wide 
studies in human cells show that E2F7 is critical for 
controlling transcription of Gi/S genes and we observed 
no enrichment for genes involved in cellular stress 
responses. This discrepancy is most likely related to fact 
that E2p~ l ~E2f8~ l ~ embryos show multifocal severe 
E2F1 -dependent apoptosis and vascular defects. In 
addition, genetic disruption of atypical E2fs in mice 
leads to a severe placental phenotype (G. Leone, 
personal communication), indicating that oxygen and 
nutrient supply is reduced in E2f7~^~E2f8~^~ embryos 
resulting in activation of a cellular stress response, as we 
have previously described in Rb deficient mice (47,48). 
Furthermore, we have deleted atypical E2fs in the liver 
utilizing albumin-cre transgenic mice and found that the 
majority of the top E2F7 targets were upregulated in 
postnatal livers of E2fl~ l ~E2f8~ l ~ mice (S. Pandit and 
A. de Bruin, unpublished data). These findings point 
towards a direct role of E2F7 to control transcription of 
Gi/S genes rather than regulating stress related genes. 



SUPPLEMENTARY DATA 

Supplementary Data are available at NAR Online: 
Supplementary Tables 1-8, Supplementary Figures 1-6, 
Supplementary Methods and Supplementary Reference 
[20]. 
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